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Abstract 

We give a simple new proof of a theorem of Duquesne, stating that the properly rescaled 
contour function of a critical aperiodic Galton- Watson tree, whose offspring distribution is 
in the domain of attraction of a stable law of index 6 S (1,2], conditioned on having total 
progeny n, converges in the functional sense to the normalized excursion of the continuous- 
time height function of a strictly stable spectrally positive Levy process of index 6. To this 
end, we generalize an idea of Le Gall which consists in using an absolute continuity relation 
between the conditional probability of having total progeny exactly n and the conditional 
probability of having total progeny at least n. This new method is robust and can be 
adapted to establish invariance theorems for Galton- Watson trees having n vertices whose 
degrees are prescribed to belong to a fixed subset of the positive integers. 

Keywords. Conditioned Galton- Watson tree, Stable continuous random tree, Scal- 
ing limit, Invariance principle. 
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Introduction 

In this article, we are interested in the asymptotic behavior of critical Galton- Watson trees 
whose offspring distribution may have infinite variance. Aldous [I] studied the shape of large 
critical Galton- Watson trees whose offspring distribution has finite variance and proved that 
their properly rescaled contour functions converge in distribution in the functional sense to the 
Brownian excursion. This seminal result has motivated the study of the convergence of other 
rescaled paths obtained from Galton- Watson trees, such as the Lukasiewicz path (also known 
as the Harris walk) and the height function. In [2T], under an additional exponential moment 
condition, Marckert & Mokkadem showed that the rescaled Lukasiewicz path, height function 
and contour function all converge in distribution to the same Brownian excursion. In parallel, 
unconditional versions of Aldous' result have been obtained in full generality. More precisely, 
when the offspring distribution is in the domain of attraction of a stable law of index 9 G 
(1,2], Duquesne & Le Gall [9j showed that the concatenation of rescaled Lukasiewicz paths of 
a sequence of independent Galton- Watson trees converges in distribution to a strictly stable 
spectrally positive Levy process X of index 9, and the concatenation of the associated rescaled 
height functions (or of the rescaled contour functions) converges in distribution to the so-called 
continuous-time height function associated to X. In the same monograph, Duquesne & Le Gall 
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explained how to deduce a limit theorem for Galton- Watson trees conditioned on having at least 
n vertices from the unconditional limit theorem. Finally, still in the stable case, Duquesne [8] 
showed that that the rescaled Lukasiewicz path of a Galton- Watson tree conditioned on having 
n vertices converges in distribution to the normalized excursion of the Levy process X (thus 
extending Marckert & Mokkadem's result) and that the rescaled height and contour functions of 
a Galton- Watson tree conditioned on having n vertices converge in distribution to the normalized 
excursion of the continuous-time height function H exc associated to X (thus extending Aldous' 
result). 

In this work, we give an alternative proof of Duquesne's result, which is based on an idea 
that appeared in the recent papers |17[ IT9] . Let us explain our approach after introducing some 
notation. For every x G R, let [^J denote the greatest integer smaller than or equal to x. If 
/ is an interval, let 6(1, R) be the space of all continuous functions / — > R equipped with the 
topology of uniform convergence on compact subsets of /. We also let D(7, R) be the space 
of all right-continuous with left limits (cadlag) functions I — > R, endowed with the Skorokhod 
Ji-topology (see [5, chap. 3], [T3| chap. VI] for background concerning the Skorokhod topology). 
Denote by the law of the Galton- Watson tree with offspring distribution fi. The total progeny 
of a tree r will be denoted by C( r )- Fi x & G (1,2] and let (X t ) t > be the spectrally positive Levy 
process with Laplace exponent E[exp(— XX t )] = exp(t\ e ). 

(0) We fix a critical offspring distribution fi in the domain of attraction of a stable law of 
index 9 G (1,2]. If Z/i, C/g, . . . are i.i.d. random variables with distribution fi, and W n = 
Ui + ■ ■ ■ + U n — n, there exist positive constants (B n ) n > such that W n /B n converges in 
distribution to X±. 

(1) Fix a G (0,1). To simplify notation, set W< n ^ = (W"' (n) ,0 < j < \na\) where W"' (n) = 
Wj(r)/B n and W(r) is the Lukasiewicz path of r. Then for every function f n : ZL ara J +1 — >■ 
R + , the following absolute continuity relation holds: 

[f n (W a ^)\ C(r) = n]=E, [f n (W a ^)D^ (W LanJ (r)) | C(r) > n] (1) 

with a certain function : N — > R + . 

(ii) We establish the existence of a measurable function r a : R + — > R + such that the quantity 

Dn{j) — F a (j/B n ) goes to as n — > oo, uniformly in values of j such that j / B n stays 
in a compact subset of Rl. Furthermore, if H denotes the continuous-time height process 
associated with X and N stands for the ltd excursion measure of X above its infimum, we 
have for every bounded measurable function F : D([0, a], R) — > R+: 

N (F((H t ) < t < a )T a (X a )\ ( > 1) = N (F((H t ) < t < a )\ ( = 1) , (2) 

where ( is the duration of the excursion under N. 

(iii) We show that under P /t [- \((t) = n], the rescaled height function converges in distribution 
on [0, a] for every a G (0,1). To this end, we fix a bounded continuous function F : 
D([0, a], R) -)■ M+ and apply formula Q with f n (W a ^) = F (^H [ntj (r); < t < a) where 
H(t) is the height function of the tree r. Using the previously mentioned result of Duquesne 
& Le Gall concerning Galton- Watson trees conditioned on having at least n vertices, we 
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show that we can restrict ourselves to the case where Wi an ,j(r)/S n stays in a compact 
subset of M.* + , so that we can apply (ii) and obtain that: 



lim E M F —H lnt] (r); < t < a) C(r) = n 



= Urn E M If ^fr LntJ (r); < t < aj £>W (W LanJ (r)) | C(r) > n 

= N(F(H t ;0<t<a)T a (X a ) | C > 1) 
= N (F(H t ; <t < a) \ ( = 1) . 




(iv) By using a relationship between the contour function and the height function which was 
noticed by Duquesne & Le Gall in |9], we get that, under P M [ ■ |C(r) = n], the scaled contour 
function converges in distribution on [0, a]. 

(v) By using the time reversal invariance property of the contour function, we deduce that 
under P M [- \((t) = n], the scaled contour function converges in distribution on the whole 
segment [0,1]. 

(vi) Using once again the relationship between the contour function and the height function, 
we deduce that, under P M [- |C(r) = n], the scaled height function converges in distribution 



In the case where the variance of \i is finite, Le Gall gave an alternative proof of Aldous' 
theorem in [17, Theorem 6.1] using a similar approach based on a strong local limit theorem. 
There are additional difficulties in the infinite variance case, since no such theorem is known in 
this case. 

Let us finally discuss the advantage of this new method. Firstly, the proof is simpler and 
less technical. Secondly, we believe that this approach is robust and can be adapted to other 
situations. For instance, using the same ideas, we have established invariance theorems for 
Galton- Watson trees having n vertices whose degrees are prescribed to belong to a fixed subset 
of the nonnegative integers j 15] . 

The rest of this text is organized as follows. In Section 1, we present the discrete framework 
by defining Galton- Watson trees and their codings. We explain how the local limit theorem gives 
information on the asymptotic behavior of large Galton- Watson trees and present the discrete 
absolute continuity relation appearing in Q. In Section 2, we discuss the continuous framework: 
we introduce the strictly stable spectrally positive Levy process, its Ito excursion measure N and 
the associated continuous-time height process. We also prove the absolute continuity relation 
([3]) . Finally, in Section 3 we give the new proof of Duquesne's theorem by carrying out steps 
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Notation and main assumptions. Throughout this work 9 e (1,2] is a fixed parame- 
ter. We consider a probability distribution ([i(j))j>a on the nonnegative integers satisfying the 
following three conditions: 



on [0,1]. 
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(i) // is critical, meaning that SfcLo^M^) = !• 

(ii) /x is in the domain of attraction of a stable law of index 9 G (1, 2}. This means that either 
the variance of \i is positive and finite, or fi(\j, oo)) = j~ e L(j), where L : IR + — >■ IR + is a 
function such that L(x) > for a; large enough and lim^oo L(tx)/L(x) = 1 for all t > 
(such a function is called slowly varying). We refer to [6 J or [10, chapter 3.7] for details. 

(iii) fi is aperiodic, which means that the additive subgroup of the integers Z spanned by 

Mi) 7^ 0} i s n °t a proper subgroup of Z. 
We introduce condition (iii) to avoid unnecessary complications, but our results can be extended 
to the periodic case. 

In what follows, (X t )t>o will stand for the spectrally positive Levy process with Laplace 
exponent E[exp(— AX t )] = exp(tX e ) where t, A > and pi will denote the density of X\. Finally, 
v will stand for the probability measure on Z defined by u(k) = p,{k + 1) for k > — 1. Note that 
v has zero mean. 

1 The discrete setting : Galton- Watson trees 
1.1 Galton- Watson trees 

Definition 1.1. Let N = {0, 1, . . .} be the set of all nonnegative integers and N* = {1, . . .}. Let 
also U be the set of all labels: 

oo 

U=\J (N*) n , 

n=0 

where by convention (N*)° = {0}. An element of U is a sequence u — u% • • • Uj of positive integers, 
and we set \u\ = j, which represents the "generation" of u. If u = U\ • • - Uj and v = V\ • • -v^ 
belong to U, we write uv — U\ • • • UjV\ • • • Vf. for the concatenation of u and v. In particular, note 
that M0 = 0m = u. Finally, a rooted ordered tree t is a finite subset of U such that: 

1. G r, 

2. if v G r and v = uj for some j G N*, then k G r, 

3. for every u G r, there exists an integer fc u (r) > such that, for every j G N*, uj G r if and 
only if 1 < j < k u (r). 

In the following, by tree we will mean rooted ordered tree. The set of all trees is denoted by 
T. We will often view each vertex of a tree r as an individual of a population whose r is the 
genealogical tree. The total progeny of r will be denoted by £(t) = Card(r). Finally, if r is a 
tree and u G r, we set T u r = {v G U; uv G r}, which is itself a tree. 

Definition 1.2. Let p be a probability measure on N with mean less than or equal to 1 and such 
that p(l) < 1. The law of the Galton- Watson tree with offspring distribution p is the unique 
probability measure P p on T such that: 

1. P# =j]=p(j)forj>O, 

2. for every j > 1 with p(j) > 0, conditionally on {k^ = j}, the shifted trees T\T, . . . , TjT are 
i.i.d. with distribution P p . 

A random tree whose distribution is P p will be called a GW p tree. 
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1.2 Coding Galton- Watson trees 



We now explain how trees can be coded by three different functions. These codings are crucial 
in the understanding of large Galton- Watson trees. 
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Figure 1: 
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A tree r with its vertices indexed in lexicographical order and its contour function 
<u< 2(C(r) - 1). Here, C(r) = 26. 
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Figure 2: The Lukasiewicz path (Wuj(t); < u < £(r) + 1) and the height function 
(H luj (r), < u < C(r)) of the tree r of Fig. [XJ 



Definition 1.3. We write u < v for the lexicographical order on the labels U (for example 
< 1 < 21 < 22). Consider a tree r and order the individuals of r in lexicographical order: 
= u (o) < u{\) <■■■ < w(C(r)-l). The height process H(t) = (H n (r),0 <n< CO)) is defined, 
for < n < C{t), by H n (r) = \u(n)\. For technical reasons, we set Hk(r) = for k > C( r )- We 
extend H(r) to E + by linear interpolation by setting H t {r) = (1 — {t})i/^j (r) + ^j+iO) for 
< t < C(r), where {t} = t - [tj . 

Consider a particle that starts from the root and visits continuously all edges at unit speed 
(assuming that every edge has unit length), going backwards as little as possible and respecting 
the lexicographical order of vertices. For < t < 2(£(r) — 1), C t (r) is defined as the distance 
to the root of the position of the particle at time t. For technical reasons, we set Ct(r) = for 
t G [2(£(r) — 1), 2((t)]. The function C(r) is called the contour function of the tree r. See Figure 
[T]for an example, and [8j Section 2] for a rigorous definition. 

Finally, the Lukasiewicz path W(r) = (W n (r),n > 0) of a tree r is defined by W (t) = 0, 
W n+1 (r) = W n (r) + k u(n) (r) - 1 for < n < ((r) - 1 and W fc (r) = for fe > C(r). 

Note that necessarily W^( T )(r) = — 1. 

Let (W n ; n > 0) be a random walk which starts at with jump distribution = fi(k + 1) 
for > —1. For j > 1, define Q = inf{n > 0; W n = —j}- 
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Proposition 1.4. (Wo, W%, . . . , W^) has the same distribution as the Lukasiewicz path of a GW M 
tree. In particular, the total progeny of a GW^ tree has the same law as £1 . 

Proof. See [T%| Proposition 1.5]. □ 

We will also use the following well-known fact (see e.g. Lemma 6.1 in [22] and the discussion 
that follows). 

Proposition 1.5. For every integers 1 < j < n, we have F[Q = n] = -P[W n = 



1.3 Slowly varying functions 

Slowly varying functions appear in the study of domains of attractions of stable laws. Here we 
recall some properties of these functions in view of future use. 

Recall that a positive measurable function L : K + — > M + is said to be slowly varying if 
L(x) > for x large enough and, for all t > 0, L(tx)/L(x) — > 1 as x — > oo. A useful result 
concerning these functions is the so-called Representation Theorem, which states that a function 
L : R + K + is slowly varying if and only if it can be written in the form: 

(f x e(u) \ 
J duj , x > 0, 

where c is a nonnegative measurable function having a finite positive limit at infinity and e is 
a measurable function tending to at infinity. See e.g. [6, Theorem 1.3.1] for a proof. The 
following result is then an easy consequence. 

Proposition 1.6. Fix e > and let L : R + — > M be a slowly varying function. There exist two 
constants C > 1 and N > such that -^x~ € < L{nx)/L{n) < Cx e for every integer n > N and 
x>l. 



1.4 The Local Limit Theorem 

Definition 1.7. A subset A C Z is said to be lattice if there exist b e Z and an integer d > 2 
such that A C b + dZ. The largest d for which this statement holds is called the span of A. A 
measure on Z is said to be lattice if its support is lattice, and a random variable is said to be 
lattice if its law is lattice. 

Remark 1.8. Since /i is supposed to be critical and aperiodic, using the fact that /i(0) > 0, it 
is an exercise to check that the probability measure v is non-lattice. 

Recall that p\ is the density of X\. It is well known that pi(0) > 0, that p\ is positive, 
bounded and continuous, and that the absolute value of the derivative of p\ is bounded over M 
(see e.g. [24"1 I. 4.]). The following theorem will allow us to find estimates for the probabilities 



appearing in Proposition 1.5 



Theorem 1.9 (Local Limit Theorem). Let (Yn) n >o be a random walk on Z started from such 
that its jump distribution is in the domain of attraction of a stable law of index 9 G (1, 2]. Assume 
that Y\ is non-lattice, that E[Yi] = and that Y\ takes values inNU { — 1}. 
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(i) There exists an increasing sequence of positive real numbers (a„) n >i such that Y n /a n con- 
verges in distribution to X\ . 



(ii) We have lim sup 
n ^°° fcez 



a n F[Y n 



k] - pi C 



k 



(Hi) There exists a slowly varying function I : 



0. 



such that a r 



n 



n 



Proof. For (i), see [TTj Section XVII. 5, Theorem 3] and [6J Section 8.4]. The fact that (a n ) may 
be chosen to be increasing follows from [TU| Formula 3.7.2]. For (ii), see fT2\ Theorem 4.2.1]. For 
(iii), it is shown in |T2l p. 46] that a^ n ja n converges to k 1 ^ 9 for every integer k > 1. Since (a n ) 
is increasing, a theorem of de Haan (see [HI Theorem 1.10.7]) implies that there exists a slowly 
varying function I : IR + — > IR + such that a n = l{n)n 1 / 9 for every positive integer n. □ 



Let (W n ) n >o be as in Proposition 1.4 a random walk started from with jump distribution 
v. Since /i is in the domain of attraction of a stable law of index 8, it follows that v is in the 
same domain of attraction, and W\ is not lattice by Remark 1.8 Since v has zero mean, by 
the preceding theorem there exists an increasing sequence of positive integers (B n ) n >i such that 
B n — > oo and W n /B n converges in distribution towards X\ as n — > oo. In what follows, the 
sequence (B n ) n >i will be fixed, and h : M.+ — > IR+ will stand for a slowly varying function such 
that B n = h^n 1 / 9 . 

Lemma 1.10. We have: 

(i) P„[C(r)=n] 
where we write a n ~ 



Pi(0) 



(ii) P M [C(r)>n] 



n^oo n 1 / 6 ' +1 /i(n) ' 
6 n if a n /b n -)• 1. 

Proof. We keep the notation of Proposition 1.4 Proposition |1.5| gives that: 



n— »oo 



gpi(O) 
n l / 9 h(n) 



-P[W n = -1]. 



// 



(3) 



For (i), it suffices to notice that the local limit theorem (Theorem 1.9) and the continuity of p\ 

pi{ti)/(h(n)n 1 ' 9 ). For (ii) 



entail F[W n = -1] 



oo / 

Pm [C(r) > n] = W 

k=n \ 



we use (i) to write: 
I , , 1 



h(k)k 1+1 / 9 



Pi (0) + 



h^k 1 ^/ 9 



5(k) 



where 5(k) tends to as k — > oo. We can rewrite this in the form: 

/•oo 

/ duf n (u), 

where: 



fn(u 



/i(n)n 1/9 P At [C(r) > n] 
/i(n)n 1 / e+1 



(4) 



(P! (0) + <5(M))- 



1.6 



for 



/i( [nwj ) 

For fixed u > 1, / n (w) tends to J^/i+i as n -)■ oo. Choose e G (0, l/#). By Proposition 
every sufficiently large positive integer n we have f n (u) < C /u l+1 l 9 ~ e for every u > 1, where C 
is a positive constant. The dominated convergence theorem allows us to infer that: 

Pi(0) 



lim 

n— >oo 



duf n (u) 

and the desired result follows from Q. 



du- 



ll 



0Pi(O), 



□ 
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1.5 Discrete absolute continuity 

The next lemma in another important ingredient of our approach. 



Lemma 1.11 (Le Gall & Miermont). Fix a £ (0, 1). Then, with the notation of Proposition 1.5 
for every n > 1 and for every bounded nonnegative function f n on zL an J +1 ; 



E[f n (W ,...,W [nai )\( 1 = n 



E 



fn(W , . . . ,W [nai 



(Pn—[an] 

(w [an] + i)Mi(i) 



Cl>™ 



(5) 



where <f) p {j) = P [Cj — p] 0p(j) = P [Cj — p] / or every integers j > 1 and p > 1. 



Proof. This result follows from an application of the Markov property to the random walk W at 
time |_anj. See [T9l Lemma 10] for details in a slightly different setting. □ 



2 The continuous setting: stable Levy processes 

2.1 The normalized excursion of the Levy process and the continuous- 
time height process 

We follow the presentation of [8J. The underlying probability space will be denoted by (fl, iF, P). 
Recall that X is a strictly stable spectrally positive Levy process with index 6 £ (1,2] such that 
for A > 0: 

E[exp(-AX*)] = exp(tA e ). (6) 

We denote the canonical filtration generated by X and augmented with the P-negligible sets 
by (9 r t)t>o- See [2] for the proofs of the general assertions of this subsection concerning Levy 
processes. In particular, for 8 = 2 the process X is y/2 times the standard Brownian motion on 
the line. Recall that X has the following scaling property: for c > 0, the process (c~ 1//6 X rf , t > 0) 
has the same law as X. In particular, the density p t of the law of X t enjoys the following scaling 
property: 

Pt(x)=t- 1/e Pl (xt- 1/6 ) (7) 
for x £ R, t > 0. The following notation will be useful: for s < t, we set J t s = inf^ ^X and 
It = inf[o,t] X. Notice that the process I is continuous since X has no negative jumps. 

The process X — / is a strong Markov process and is regular for itself with respect to X — /. 
We may and will choose —I as the local time of X — I at level 0. Let (gi,di),i £ J be the 
excursion intervals of X — I above 0. For every i £ J and s > 0, set u\ = X( g . +S ) A ^- — X g .. We 
view id 1 as an element of the excursion space £, which is defined by: 

£ = {u £ B(R+, E); w(0) = and : = S up{s > 0; > } £ (0, oo)}. 

From Ito's excursion theory, the point measure 

is a Poisson measure on R + x £ with intensity dt~N(du), where N(c/cj) is a u-finite measure on £. 
By classical results, N(£ > t) = T(l — l/^) _1 t _1 / e . Without risk of confusion, we will also use 
the notation X for the canonical process on the space D(R + ,M). 



8 



We now define the normalized excursion of X. Let us first recall the ltd description of the 
excursion measure (see [7] or [2] Chapter VIII. 4] for details). Define for A > the re-scaling 
operator on £ by S^(uj) = (A 1 / 6, w(s/A), s > 0) . Then there exists a unique collection of 
probability measures (N( ), a > 0) on £ such that the following properties hold. 

(i) For every a > 0, N( a )(£ — a) — 1. 

(ii) For every A > and a > 0, we have S^(N^) = N( Aa ). 

da 



roo 

(iii) For every measurable subset A of £: N(A) = / N( a )(A) 

*t 



9T(1 - l/9)a 1 i 



H-i 



The probability distribution N(i) on cadlag paths with unit lifetime is called the law of the 
normalized excursion of X and will sometimes be denoted by N( • \( = 1). In particular, for 9 = 2 
the process X exc is a/2 times the normalized excursion of linear Brownian motion. Informally, 
N( • \( = 1) is the law of an excursion conditioned to have unit lifetime. 

We will also use the so-called continuous-time height process H associated with X which was 
introduced in [20]. If 9 = 2, H is set to be equal to X — I. If 9 e (1, 2), the process H is defined 
for every t > by: 

l r* 



H t := lim- / l {Xs< i ? + e }ds, 

where the limit exists in P-probability and in N-measure on {t < £}. The definition of H thus 
makes sense under P or under N. The process H has a continuous modification both under P 
and under N (see [91 Chapter 1] for details), and from now on we consider only this modification. 
Using simple scale arguments one can also define H as a continuous random process under 
N( • \( = 1). For our purposes, we will need the fact that, for every a > 0, (H t )o<t< a is a 
measurable function of (X t ) < t < a . 

2.2 Absolute continuity property of the Ito measure 

We now present the continuous counterpart of the discrete absolute continuity property appearing 



in Lemma 1.11 We follow the presentation of [T7] but generalize it to the stable case. The 
following proposition is classical (see e.g. the proof of Theorem 4.1 in [23, Chapter XII], which 
establishes the result for Brownian motion). 

Proposition 2.1. Fix t > 0. Under the conditional probability measure N( • |£ > t), the process 
{X t+S ) s > is Markovian with the transition kernels of a strictly stable spectrally positive Levy 
process of index 9 stopped upon hitting 0. 

We will also use the following result (see U Corollary 2.3] for a proof). 

Proposition 2.2. Set q s (x) = -p s {—x) for x, s > 0. For x > 0, let T(x) = inf{t > 0; X t < —x} 

be the first passage time of —X above x. Then P [T(x) G dt] = q t (x)dt for every x > 0. 

Note that q s is a positive continuous function on (0, oo), for every s > 0. It is also known 
that q s is bounded by a constant which is uniform when s varies over [e, oo), e > (this follows 
from e.g. [2H I- 4.]). 
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Proposition 2.3. For every a G (0, 1) and x > define: 

dqi-a{x) 

Then for every measurable bounded function G : D([0, a], IR 2 ) — > R + : 

N (G((X t )o<t<«, (H t ) < t < a )r a (X a )\ C > 1) = N (G((X t ) <t<a, (#t)o<t<«)| C = 1) • 

Proof. Since (H t )o<t< a is a measurable function of (Xt)o<t< a , it is sufficient to prove that for 
every bounded measurable function F : O([0,a],R) — > R+: 

N(F((X t ) 0<t<a )r a (x a )|c>i) = N(F((x t ) 0<t<a)\ C = i). 



be two bounded continuous functions and let 



To this end, fix r G [0, a], let /, g : 
/i : Rl — )■ R + be a continuous function. Using the notation of Proposition 2.2, we have: 



N(f(X r )h(X a )g(OMt>a}) = N(fXX r )l K>a} E[h(x)g(a + T(x))} x=Xa ) 

dsg{a + s)N (/(X r )/i(X a )g s (X a )l {?>a} 

dug{u)N(f{X r )h{X a )q u „ a (X a )l {c>a} 



where we have used Proposition 2.1 in the first equality and Proposition 2.2|in the second equality. 



Moreover, by property (iii) in subsection 2.1 
N(/(X r MC)l {c>a} ) = 



du 



9W 



Z7m' N («)(/W)- 



(9) 



ev(i - 1/6)^/°+ 

Now observe that (fsl) (with h — 1) and fcfy hold for any bounded continuous function g. Since both 
functions u \- > N mX r )g u _ a (X a )l and w h-> N( u ) (f(X r )) are easily seen to be continuous 

over (a, oo), it follows that for every u > a: 

N (/(X r )g u _ a (X )l {c>0} ) = flrn _ * 1/g+1 N (tl) (/(X r )) . 



In particular, for it = 1 we get: 

N(/(X r )g 1 _ a (X a )l {?>a} 



0r(i - l/e)^/ 
i 



0r(i - i/e) 



N(i) (/(X r )) . 



(10) 



On the other hand, applying pi) with g = 1 and noting that N(£ > 1) 



i 



r(i-i/<?) 



, we get: 



N(f(X r )h(X a ) |C > 1) = T(l - 1/0)N (f(X r )h(X a )l {c>a} r dsq s (X a ] 

^ J 1— a 



By combining (Jllj) and (10) we conclude that: 

eqi-a(X a ) 



N /(X r 



C>1 =N (1) (/(X r )). 



One similarly shows that for < T\ < ■ ■ ■ < r n < a and fi, ■ ■ ■ ,f n : 
bounded functions: 



— >• 



continuous 



N /l(X n ) ■ ■ ■ fn{X rn ) 



0qi- a {X a ) 



C>1 =N (1) (A(X ri )---/ n (X r J). 



The desired result follows since the Borel a-field of D([0, a],R) is generated by the coordinate 
functions X i— )■ X r for < r < a (see e.g. [3, Theorem 12.5 (iii)]). □ 
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3 Convergence to the stable tree 
3.1 An invariance theorem 

We rely on the following theorem, which is similar in spirit to Donsker's invariance theorem (see 
the concluding remark of [9j Section 2.6] for a proof). 

Theorem 3.1 (Duquesne & Le Gall). Let t n be a random tree distributed according to P M [ • | ((r) > 
n]. We have: 



I B 

-j^-W\ nt \(tn), — H nt (i n 



t>o 



(<0 

n— >oo 



(X t ,#t)o<t<i under N( ■ |C > 1) 



3.2 Convergence of the scaled contour and height functions 

Recall the notation n (j) = P [Q = n] and <f>* n (j) = P [Q > n). 
Lemma 3.2. Fix a > 0. We have: 



(i) lim sup 



l<k<aB n 



Tuf> n (k) - qx 



k 

B,, 



0. 



{%%) lim sup 



\<k<aB n 



as g s — 



0. 



This has been proved by Le Gall in [T7] when /1 has finite variance. In full generality, the 
proof is technical and is postponed to Section 3.3. 

Lemma 3.3. Fix a 6 (0, 1). Let tn be a random tree distributed according to P M [- | £(t) = n]. 
Then the following convergence holds in distribution in the space C([0,a],M): 



"■MtrO; < t < aj 



n— >oo 



(H t ; 0<t<a) under N( • |C = 1). 



Proof. Recall the notation T a introduced in Proposition 2.3 We start by verifying that, for 



a > 1, we have: 



lim I sup 

™ 1 \B n <k< a B n 



(Pn— [an] (fc + l)/0 n (l) 



C-Lanj(^ + !)/<(!) 



r„ 



B n 



12) 



We first note that by Lemma 1.10, 0* (l)/(n0 n (l)) — > 6 as n — > 00. Then Lemma 3.2 implies 
that: 



lim 



sup 

±B n <k<aB n 



(A; + l)/0 n (l) 



0- 



1-0 



fc+1 



fc+i 



n— [o.n J 







(13) 



provided we can verify the existence of a constant 5 > such that, for n sufficiently large, 

k 



inf 

±-B n <k<aB n Jl 



dsq s 



B 



> 5. 



n— [an\ 



This follows from the fact that, for every (3 > 1: 

POO 

inf / dsq s (x) > 0. 

\<x<BJ\ 
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Details are left to the reader. Our claim (12) follows from (13) using the scaling property ([7]). 

Now let F : D([0, a],R) — > R+ be a bounded continuous function. Fix a > 1. To simplify 
notation, if r is a tree, define the event A°(t) by A°(t) = {W|„ a j (r) G [^B n , aB n j } . We also set 
F n ( T ) = F (^-H^ntjfr); < t < a). Since (H (r), Hi(r), . . . , ff|_ an j(r)) is a measurable function 
of (W (r), Wi(r), . . . , W LanJ (r)) (see pi Prop 1.2]), by Proposition O Q and Q we have: 



lim 

n— too 



E [F n (t n )l 



E, 



-P 1 n(T r )l J 4^(r)r a 



w 



[an] 



fin 



C(r) > n 



By Theorem 3.1 the law of 



J) 



under P jU [- | C(r) > n] converges towards the law of ((H t ; < t < a),X a ) under N(- | ( > 1) (for 
the convergence of the second component we have also used the fact that X is almost surely 
continuous at a). Thus: 



lim E 

n— >oo 



^) 1 {W LnaJ (t 7l )e[iB„,aB n ]} 



N [F(H t ; < t < a)T a (X a )l {Xae[ha]} \ ( > 1 



N ( F(H t ; < t < a)l 



C = i 



(14) 



where we have used Proposition |2.3| in the second equality. 
By taking F = 1, we obtain: 



lim P 

n— >oo 



W [naJ (t n ) G 



1 



a 



N X a G 



a 



a 



C = i 



from the convergence (14) that: 



This last quantity tends to 1 as a — > oo. By choosing a > 1 sufficiently large, we easily deduce 

N( J P( J ff t ;0<t<a)|C = l). 



lim E 

n— »oo 



The path continuity of H under N ( • | ( = 1) then implies the claim of Lemma 3.3 



□ 



Theorem 3.4. Let i n be a random tree distributed according to P M [- | ((r) = n]. Then: 

( —H nt (i n ), ^C 2nt (t n )) ^ (H t , H t ) 0<t<1 under N( • |C = 1). 

\n n J 0<t<1 n^oo 



Proof. The proof consists in showing that the scaled height process is close to the scaled contour 
process and then using a time-reversal argument in order to show that the convergence holds 
on the whole segment [0, 1]. To this end, we adapt [HI Remark 3.2] and |9l Section 2.4] to our 
context. For < p < n set b p = 2p — H p (t n ) so that b p represents the time needed by the 
contour process to reach the (p + l)-st individual of t n (in the lexicographical order). Also set 
b n = 2(n — 1). Note that Cb p = H p . From this observation, we get: 



sup |C t (tn) - HpiQl < \H p+1 (t n ) - H p (tn)\ + 1. 

t£[b p ,b p+1 ] 



(15) 
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for < p < n. Then define the random function g n : [0, 2n] — > N by setting g n (t) = k if 
t G [fefc, &fc+i) and k < n, and = n if t G [2(n — 1), 2n] so that for t < 2(n — 1), #«(£) is the 
index of the last individual which has been visited by the contour function up to time t if the 
individuals are indexed 0, 1, . . . , n — 1 in lexicographical order. Finally, set g n (t) = g n (nt)/n. Fix 



a G (0,1). Then, by gSJ: 

B, 



sup 

' '[qnj 



t< 



C n t(i n ) 



n 



B,, 



n 



<— + — sup \H k+1 (i n ) - H k (i n ) 

n U k<\an\ 



(16) 



which converges in probability to by Lemma 3.3 and the path continuity of (H t ). On the other 
hand it follows from the definition of b„ that: 



sup 



t< 



'Jan] 



9n(t) 



1 B n , . 1 

£n n k<an n n 



towards as n — > oo. Using (16), we conclude that: 



3.3 



we see that converges 



by Lemma 3.3 Finally, by the definition of b n and using Lemma 
in probability towards 2a and that ^ sup i<2a H n ~ n ^(i n ) — H nt /2{tn) converges in probability 



B 



- SUp | C 2n t (in) ~ H nt (in) \ 

Tl 0<t<a 



0. 



(17) 



Together with Lemma [3. 3[ this implies: 

B r 



n 



' : C 2nt (i n ); 0<t<a 



(d) 



(H t ; 0<t<a) under N( ■ | C = 1' 



Since (Ct(i n )', < t < 2n — 2) and (C/2n-2-t(tn)j < t < 2n — 2) have the same distribution, it 
follows that: 



(^C 2nt (t n ); 0<t<l) 



(H t ; < t < 1) under N( • | C = 1). 



See the last paragraph of the proof of Theorem 6.1 in [T7] for details. 

Finally, we show that this convergence in turn entails the convergence of the rescaled height 



function of i n on the whole segment [0, 1]. To this end, we verify that convergence (17) remains 
valid for a = 1. First note that: 



sup 

0<t<2 



9n(t) 



< ~ SUpH k (t n ) + - = SUp — Cfc(tn) + - 

2n k<n n 2B n k <2n n n 



(P), 







(19) 



by (18). Secondly, it follows from (|15|) that: 

B ri 



sup 

0<i<2 



-C nt (i n ) 



n 



ng„(t) 



< 



Bn 










sup 


n 


n 


k<n-l 


B n 


B n 






+ — 


sup 


n 


n 


fc<n-l 



C b k+i(in) — Cb^ (in) 
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By (18), in order to prove that the latter quantity tends to in probability, it is sufficient to 
verify that sup fc<n b ^ ±1 — — converges to in probability. But by the definition of b n : 



sup 

k<n 



bk+l 


_ bk 


n 


n 


'fc+l 


h 


n 


n 



sup 

k<n 



2 + H k {i n ) — H k+ i(tn) 



n 



2 H k (i n ) 
< — h 2 sup 

n k<n n 



which converges in probability to as in (19). As a consequence 



B 



- SUp |C 2n t(tn) — H~ n {2t){^n) 



By (18), we get that: 
B n 



H 



n 



ng n (2t){^n, 



0<<<1 



n— >oo 



(^) <t<i under N( • | C = 1) 



Combining this with (19), we conclude that: 



CWiiftn)) — H nt (i n ) 

n n 



0<t<l 



(«0 

n— too 



{H t ,H t ) <t<i under N(-|C = 1). 



This completes the proof. 



□ 



Remark 3.5. If we see the tree in as a finite metric space using its graph distance, this theorem 
implies that t n , suitably rescaled, converges in distribution to the ^-stable tree, in the sense of the 
Gromov-Hausdorff distance on isometry classes of compact metric spaces (see e.g. |18l Section 
2] for details). 

Remark 3.6. When the mean value of \i is greater than one, it is possible to replace /x with a 
critical probability distribution belonging to the same exponential family as \i without changing 
the distribution of t n (see [14]). Consequently, the theorem holds in the supercritical case as 
well. When /x is subcritical and the radius of convergence of X^oMO^* ^ s greater than 1, this 
operation is still possible. The case where /x is subcritical and \x{i) ~ L{i)/i l+e as i — > oo has 
been treated in |16j . However, in full generality, the non-critical subcritical case remains open. 



3.3 Proof of the technical lemma 



In this section, we prove Lemma 3.2 



Proof of Lemma 3.2 . We first prove (i). By the local limit theorem (Theorem 1.9 (ii)), we have, 
for k > 1 and j 6 Z: 

B r . 



B n F[W n =j}- Pl 



where e(n) — >■ 0. By Proposition 1.5, we have ri(f) n (j) = jf[W n = —j]. Since J-pi (— 
qi we have for 1 < j < aB n : 

' 3 



n<i>n(j) - Qi 



B„ 



J_ 

B. n 



B n ¥\W n = -j)- Pl 



B, 



< ae(n). 
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This completes the proof of (i). 

For (ii), first note that by the definition of q s and the scaling property ([7]): 



ds q s . 

1 V D r 



i s 



i/e+v 



,i/e 



By Proposition 1.5 and the local limit theorem: 



m - E 



k=n ,UJ ^K V k 



kB k L 



k=n 



J 



Pl 



J_ 

Bk 



k=n 



kB k 



where e(n) — y 0. Then write: 



kB k 



J_ 
B k 



- j/fl n / j/B n 



</ * 



l/e+i 



Pi 



5 



[nsj . 



J/B r , 



, rfS s l/fl+l 



Pl 



B 



[nsj . 



Pl 



Denote the first term of the right-hand side by P(n,j) and the second term by Q(n,j) . Since 
Pi is bounded by a constant which we will denote by M, we have for 1 < j ' < aB n : 



r°° 1 
P{n, 3 )<aMj i ds 



n 



B n s l l e+l 



B\n S \ [ns\ 



1.6 



the same 



For fixed s > l, sl/ g +1 ^"^j^j 1 — 1 tends to as n — y oo, and using Proposition 

quantity is bounded by an integrable function independent of n. The dominated convergence 
theorem thus shows that P(n,j) — y uniformly in 1 < j < aB n . Let us now bound Q(n,j) for 
1 < j < aB n . Since the absolute value of the derivative of p\ is bounded by a constant which we 
will denote by M', we have: 



Q(n,j) < M> 



,10+1 



J 



j/B r . 



B 



[ns\ 



B n s l 



B 



- 1 



\ns\ 



The right-hand side tends to by the same argument we used for P(n,j). We have thus proved 
that: 



lim sup 



l<j<aB n 



E 

k=n 



J 



kB k 



Pi 



k 

Bk 



ds q s 



J_ 
B, 



0. 



One finally shows that ^ 
that: 



fc=« kB^ e (ty tends to as n —y oo uniformly in 1 < j < aB n by noticing 



sup sup 



E 



™>1 !<3<aB n \ k=n 

This completes the proof. 



kB u 



< a sup ( r^!r I < x . 

\k=n 



n>l 



kB k 



□ 
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